Ancilla- Assisted Enhancement of Channel Estimation for Low-Noise Parameters 
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In order to make a unified treatment for estimation problems of a very small noise or a very weak 
signal in a quantum process, we introduce the notion of a low-noise quantum channel with one noise 
parameter. It is known in several examples that prior entanglement together with nonlocal output 
measurement improves the performance of the channel estimation. In this paper, we study this 
"ancilla-assisted enhancement" for estimation of the noise parameter in a general low-noise channel. 
For channels on two level systems we prove that the enhancement factor, the ratio of the Fisher 
information of the ancilla-assisted estimation to that of the original one, is always upper bounded 
by 3/2. Some conditions for the attainability are also given with illustrative examples. 

PACS numbers: 03.67.-a, 03.67.Hk, 03.67.Lx, 03.65.Ta 



I. INTRODUCTION 

One of the formidable obstacles for the realization of 
quantum computers is decoherence caused by the cou- 
pling between computational qubits and the environ- 
ment. Recent study of quantum error correction has 
shown that fault-tolerant quantum computing is in prin- 
ciple possible, but it requires that the noise caused by 
the decoherence should be lower than the very stringent 
threshold. Obviously, such a statement has a physical 
meaning only if we have an efficient method for quanti- 
tatively estimating very small noise in quantum devices 
in real experiments. However, if the noise is very small, 
so is our success probability of observing the disturbance 
caused by that noise. This difficulty makes evident the 
demand for the study of optimal quantum estimation of 
very small nose in general quantum channels based on 
well-established quantum estimation theory. 

Quantum estimation theory was instituted by Hcl- 
strom in the late 1960's and has been developed with var- 
ious applications until recently; for standard reviews we 
refer to Helstrom [1] and Holevo [2], and see also Hayashi 
[3] for recent progress. A typical problem of quantum es- 
timation is to ask what is the best observable, possibly 
in an extended system with ancilla, to measure in order 
to estimate the true value of 9 provided that the system 
is known to be in one of the state in a given family {pg}. 
A well-established solution for this problem is given as 
follows. We call an observable A a (locally) unbiased es- 
timator at 6 = 9q if the expectation value Eg [A] of A in 
the state pg satisfies 

Ee [A] = 6 , (1) 
d e Eg[A]\e=e = 1. (2) 
In general there are many unbiased estimators. In order 
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to select a good one, we consider the variance Vg[A] of 
an arbitrary unbiased estimator A in the state pg. Then, 
the quantum Cramer- Rao inequality 



V e [A] > 



1 



J{pe) 



(3) 



holds for any unbiased estimator A at 6, where 

J(pg) = Tr[p e L 2 ] (4) 

is the (quantum) Fisher information defined through the 
symmetric logarithmic derivative (SLD) Lg that is char- 
acterized by the relations 



dgpg = -(Lgpg + pgLg), 
Ll = Lg. 



(5) 

(6) 



The SLD is determined uniquely on the range of pg, i.e., 
Lgpg = L' e pg holds for any two SLDs Lg and Lg. The 
Cramer-Rao inequality (3) follows from a simple appli- 
cation of the Schwarz inequality for the Hilbert-Schmidt 
inner product. From the equality condition for that the 
lower bound Jg l in Eq. (3) is always achieved by any 
observable A satisfying 



Apg = (J e 1 Lg + 9)pg, 



(7) 



see Refs. [1, 2] and for a straightforward derivation see 
Appendix of Ref . [4] . In general, to find an optimal esti- 
mator for the true value 9 needs prior information on the 
value 9, which might be collected by prior estimations as- 
suming prior probabilities on the unknown parameter, so 
that the optimal estimator is considered as an ultimate 
limit allowed by physics. However, there are some cases 
in which the optimal estimator can be chosen uniformly 
over unknown values of 9 [5] . In these cases the ultimate 
limit can be certainly achieved without prior information. 

From the quantum estimation theory for state param- 
eters mentioned above, we can construct an estimation 
theory for unknown parameters of physical processes, 
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such as coupling constants of the interaction. Suppose 
that we prepare a quantum system in an initial state pi n 
and leave it in an evolution process characterized by an 
unknown parameter 9. Then, the final state p O ut{0) of 
this process depends on the parameter 9. The problem 
of finding the optimal estimation of the parameter 9 is 
solved by maximizing the Fisher information Jg over all 
the possible initial states pi n and all the possible observ- 
able A in the final state [6, 7]. The above physical process 
can be represented by a mapping Tg that transform the 
initial state p in to the final state p out as 

Pout =To[pin]- (8) 

It is now fairly well-known that every general state 
change, called a quantum operation or a quantum chan- 
nel, such as Tg, physically realizable with probability one 
should be a trace-preserving completely positive (TPCP) 
mapping, and conversely that every TPCP map can be 
realized as a unitary process of the system augmented by 
an ancilla prepared in a fixed state as shown by Kraus 
[8, 9]; see also Ref. [10, 11] for the generalization of the 
above statement to generalized measurements and see 
Ref. [12] for the latest elaboration. 

As pointed out in Ref. [13], one can improve the pa- 
rameter estimation if a correlation, or in particular an 
entanglement, is allowed between the input system S and 
an ancilla A. It should be stressed that in doing so one 
needs no physical process to occur on the ancilla sys- 
tem A while the system S passes through the channel 
Tg. In this case, the extended channel is represented as 
Tg ® id a, where id a stands for the identity channel for 
A. Then, the improvement can be achieved by the ini- 
tial preparation of the composite system in an entangled 
state together with the measurement of the composite 
system after the process. 

Recent progress has been reported on problems for spe- 
cial families of quantum channels, in particular, SU(2) 
channel [14], a generalized Pauli channel [15], a general- 
ized amplitude-damping channel [16], U(N) channel and 
its Abelian subgroup channel [17, 18]. A review by Fu- 
jiwara [19] is also available. For earlier contributions see 
also [20, 21, 22, 23, 24, 25]. In this paper, we are devoted 
to the ancilla-assistcd enhancement of Fisher information 
derived by the quantum Cramer-Rao bound, whereas 
ancilla-assisted enhancements have been recently investi- 
gated within the Bayesian approach [26, 27, 28] and the 
minimax approach [29]. 

This enhancement effect not only projects a theoreti- 
cal profundity of quantum mechanics, but also suggests 
many physical applications including the low-noise esti- 
mation in quantum computing, where the enhanced noise 
estimation is expected to contribute to developing the 
quantum error correction and quantum noise reduction 
technology [30]. 

We can find another application of the low-noise esti- 
mation in elementary particle physics. Today, because 
of technological difficulties of high-energy experiments, 
direct researches of new physics far beyond the TeV en- 



ergy scale are almost impossible. This is one of reasons 
why the low-energy rare processes predicted by the new 
physics recently attract much attention. (The CPT sym- 
metry violation in the K — K oscillation is one of the 
typical processes [31, 32, 33, 34].) Clearly, the num- 
ber of signals for the new-physics evidence is predicted 
very small, even if the process really exists in nature. 
The new-physics data should be separated from an enor- 
mous number of ordinary data explained by the standard 
model. This means that the new-physics data can be re- 
garded as a sort of background low noise in the standard 
data. Hence, we can treat the rare process as a low-noise 
channel. It is very significant to estimate the intensity of 
the low noise because indirect information about physics 
beyond the standard model is obtained. In the estima- 
tion, the above ancilla-assisted enhancement may effec- 
tively reduce the trial number of the experiment. 

In this paper, we study the estimation theory of the pa- 
rameter characterizing a small noise in a general quantum 
channel on a system with finite dimensional state space. 
We can always decompose the quantum channel into two 
channels so that the input state of the original channel 
passes through the first noiseless channel and consecu- 
tively passes through the second noisy quantum channel 
called the noise channel. Thus, we can concentrate our 
attention on the noise channel. We are interested in the 
case where the noise is so small that the noise channel 
deviates only a little from the identity channel. In such 
a case, the channel is called a low-noise channel, and 
the parameter representing the noise is called the low- 
noise parameter denoted by e. Let T e be a low-noise 
channel with low-noise parameter e. We assume that 
the low-noise parameter is scaled so that r is the iden- 
tity channel. We can formulate natural mathematical 
requirements for the behavior of the low-noise parameter 
in a neighborhood of e = 0. It is an interesting problem 
to figure out how much ancilla-assistcd enhancement can 
be achievable in the estimation of the low-noise param- 
eter e. In this paper we shall discuss this problem and 
obtain several upper bounds for this ancilla-assisted en- 
hancement factor in the low-noise parameter estimation. 

In Section 2, wc explain a theorem [13] states that the 
Fisher information is attained in a pure initial state, so 
that we can always assume that the input of the channel 
is a pure state. In Section 3, we discuss parameter estima- 
tion for unitary channels, which do not couple with the 
environment, and show that wc have no ancilla-assisted 
enhancement. Thus, the ancilla-assisted enhancement 
is possible only for channels coupled with the environ- 
ment. In Section 4, we introduce the notion of low-noise 
channels mentioned above with rigorous mathematical 
requirements, and wc obtain a general formula for the 
upper bound for the ancilla-assisted enhancement factor. 
In Section 5, we introduce two physical examples of low- 
noise channel. In Section 6, we give a concrete evaluation 
of the enhancement factor in two level systems. Let us 
consider a low-noise channel r £ with low-noise parameter 
£ in a two level system £2. We obtain a universal upper 
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bound for the enhancement factor rj defined by 

C[m&x[J S2+A } pS2+A } 



V 



C[max[Js 2 ] PSa ] 



(9) 



for any finite level ancilla A. Here, ps 2 is the input in 
the system S, Js 2 is the Fisher information of r £ [ps 2 ], 
Ps 2 +A is the channel input in the composite system S2 + 
A, Js 2 +A is the Fisher information of the output states 
(r e ® id,A)[ps 2 +A-i an( i max [']p stands for the maximum 
over all the state p. As shown later, J$ 2 and Js 2 +a shows 
a singular behavior oc 1/e in the e expansion, and C[J] is 
coefficient of cx 1/e, i.e., 



J(e) = -C[J]+0(e ). 



(10) 



The universal upper bound of the enhancement factor 
rj for all the two level systems is given by 



,4 



(11) 



This upper bound is attainable by various channels r e , 
and the corresponding optimal input state is a maximal 
entangled state, and holds for any low-noise channels on 
two level systems. 



II. THE MAXIMUM IS ATTAINED BY A PURE 
INPUT STATE: THE FUJIWARA THEOREM 



Then, it is easy to see that the SLD of p$ is L@ <g) |0) (0| + 
L T (g) |1)(1|, so that the Fisher information of pg is given 

by 

J(pe) = Tr[/5 e (L^|0)(0|+i7®|i)(l|) 2 ] 

= ATrML£) 2 ] + (1 - A)TrML£) 2 ] 

= XJ(a e ) + (1 - A) J(t 6 ). (16) 

On the other hand, the partial trace of pg over C 2 is 
given by 



Tr C 2[p e ] = Acr + (1-A)T . 



(17) 



Since the partial trace is a trace-preserving completely 
positive map, the monotonicity of the Fisher information 
under trace-preserving completely positive maps [35, 36, 
37] concludes 



J(\a 6 + (1 - A)r e ) < J(p e ). 



(18) 



Therefore, from Eq. (16) and Eq. (18) the convexity re- 
lation (12) follows. 

Now suppose that an input state p maximizes the 
Fisher information, i.e., 



Let 



J(r e [p])=max[J(r e [p])] p . 



(19) 



(20) 



In this section we briefly review an important theorem 
due to Fujiwara [13]: the maximum of the Fisher infor- 
mation of output states pg(= ^e[p\) over all possible input 
states p is attained by a pure input state for an arbitrary 
fixed channel Tg . 

To show this following Fujiwara, let Lg be the SLD 
defined by Eqs. (5), (6) for the output state pg. Then the 
Fisher information J(pg) is given by Eq. (4). Fujiwara 
[13] showed that the Fisher information has a convexity 
property, i.e., 

J(\<78 + (1 - X)Tg) < \J{a e ) + (1 - A) J{t ). (12) 

for any < A < 1, where og and Tg are states with 
parameter 9. 

To see the above relation, let Hermitian operators Lg 
and Lg be the SLDs of ag and Tg, respectively, i.e., 



dgCTg = -(LgO-g + O-gLg), 
dgTg = \^{L r gTg + TgLg). 



(13) 
(14) 



Let us consider the tensor product Hilbert space JC = 
H ® C 2 , where H is the state space of S and C 2 is a 
2-dimensional state space. With fixed basis {|0), |1)} of 
C 2 , let pg be a density operator on K. such that 



be the spectral decomposition, where < p n < 1 and 
J2Pn = 1- The output state pg is given by 



pg = T e [p] = J2p n Tg[\n)(n\}. 



(21) 



Pj = Ao-j®|0)<0| + (1-A)tj®|1)<1|. 



(15) 



By using relation (12) repeatedly, we have 

J(p*) < 5>„J(r fl [|n)<n|]). (22) 

n 

Since p maximizes the Fisher information, we also have 
J2PnJ(r0[\n)(n\}) < J(pg), (23) 

n 

and this concludes the relation J(pg) = J(rg[|n)(n|]) for 
all n. Thus, the maximum of the Fisher information is 
also attained by a pure input state. 

From now on, we assume without any loss of generality 
that the input state of the channel is always a pure state 
by virtue of this theorem. 



III. ONE-PARAMETER UNITARY CHANNELS 
HAVE NO ENHANCEMENT 

Before we go to general analysis of low-noise channels, 
let us consider the case where the channel is unitary, or 
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the channel does not interact with the environment. In- 
terestingly, the maximization of the output Fisher infor- 
mation J[pe] with respect to the input p can be explicitly 
accomplished. After the calculation of the maximum, one 
can notice that the ancilla-assisted enhancement does not 
take place at all. The result makes it clear that, in order 
to gain the ancilla-assisted enhancement for channel pa- 
rameter estimations, the channels must have the effective 
interaction between the system and the environment. 

Let U (9) be a unitary operator with an unknown pa- 
rameter 9. Then the output state of the unitary channel 
determined by U (9) for an input state | VP) is given by 

P (o) = \m){*m, (24) 

where the output state \^(9)} is defined by 

m6)) = U{6)\*). (25) 

By introducing the (logarithmic) Hamiltonian operator 
H(9) such that 



H{9) = i(d e U{9))U(9)\ 
H{9)^ = H(9), 



(26) 
(27) 



and using the result in Ref. [5] , the Fisher information of 
the output state is evaluated as 



Js\po]=W BW [H{0)], 



(28) 



where [H (9)] is the variance of H{9) in the state 
*(0),i.e, ' 

V ne) [H{9)] = (*(0)|ff(0) 2 |*(0)) - <*(0)|ff(0)|tt(0)> 2 . 

(29) 

To obtain the maximum of Js, let us consider the maxi- 
mum and minimum of the eigenvalues E n of H(9): 



E ma , x (9) = max[£ n ] n , 



E min (9) = mm[E n ] r 



(30) 



(31) 



Let | max(#)) and | min(#)) be eigenstates corresponding 
to -E m ax(^) and E m i n (9), respectively. By a straightfor- 
ward manipulation, it is easy to see that the maximum 
of Js is taken by a pure input state |$) = U(6)^\$(9)), 
where |3>(0)) is given by 

|$(fl)) = ^[|max(fl)) + |min(fl))]. (32) 

For a fixed value of 9, the maximum is given by 

max[J s ] w = j s [\m)(m\] 

= (#max(0) - E min {9)) 2 . (33) 

To obtain the corresponding result for ancilla-assisted 
estimations, let us introduce an ancilla system A and the 
extended channel defined by 



where \ty) is a state of the composite system S + A to 
be put in the extended channel. For the output state 
p{9) = \^f(9)){9(9)\, the Fisher information is given by 



j s+ A\p(0)] = 4Vi l0) me)®i A ], 



(35) 



where Vy,^\H(9)®l A \ is the variance of H(9)®1 A in the 

state ^f(9). Note that the maximum and minimum of the 
eigenvalues of H £g> 1a are taken in the states | max(#)) \a) 
|min(0))|a), respectively, with an arbitrary ancilla state 
ja), i.e., 

H(0)®l A \msx{e))\a) = E maK (9)\ max(0))|a), (36) 
H(9)®l A \mm(9))\a) = £ min (0)| min(0))|a). (37) 

Hence, the input state given by 



|$) - ^=(U(9)^l A )[\m^(9))\a) + \mm(9))\a) 

= ma) 



(38) 



takes the maximum value of Js+A, which turns out to be 
the same as that given in (33), i.e., 



max[J s+A ] |f> - J s+ a[M0)){*(9)\] 

= (E ma x(9) — E m i n (9)) 2 . 



(39) 



|tt(0)> = (U(9) ® 1 A )|*>, 



(34) 



Consequently, no enhancement by the ancilla extension 
is observed in this unitary case, i.e., 

max[Js+A}p s +A = max [ J s+A}\^ s+A ) = 1 . 4Q . 
max[ J s ]p S max[J s ]|* s ) 

It should be noted here that the above argument 
applies only to one-parameter unitary channels, for 
which Eq. (28) can be applied, whereas a generaliza- 
tion to Abelian group parameters may follow. For mul- 
tiple phase parameter estimation of unitary channels, 
Ballestcr [17, 18] showed, ancilla-assisted enhancement 
actually takes place, whereas for commuting phase pa- 
rameter estimation no enhancement occurs. 

Within Bayesian approach, Chiribella, D'Ariano, and 
Sacchi [26] showed that unitary channels with non- 
Abelian group parameter can have ancilla-assisted im- 
provement of a large class of cost functions. In this con- 
nection, Sacchi [27, 28] gave extensive analysis on the 
condition for ancilla-assisted improvement of the error 
probability for discrimination of Pauli channels. 



IV. LOW-NOISE CHANNELS 

In this section, we introduce the notion of a low-noise 
channel T e with unknown parameter e, which takes only 
small values e ~ 0, by requiring a physically natural as- 
sumption of the channel T e for the parameter values near 
e = 0. The small parameter e is assumed to control the 
low noise well enough and is called the low-noise param- 
eter. 
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As mentioned in the introduction, we will focus on 
the ancilla extension of the low-noise channel defined by 
T e ® id a- The ancilla-assisted enhancement factor r\ is 
also defined as the ratio of the Fisher information of the 
ancilla-assisted estimation to that of the original one and 
is analyzed in detail. 

The concept of the noise in a quantum process to im- 
plement a target unitary process can be understood un- 
der the following consideration. Suppose that we would 
like to implement a unitary channel A^ 7 ) for a system S, 
so that the output state corresponding to an input state 



J 



Pin of S is designed to be 

Pout = A (c/) [p m ] = Up m U f . 



(41) 



Without any noise, the unitary operator can be normally 
implemented as 



U = exp[-itH s /h], 



(42) 



where t is the time interval from input to output, and 
Hs is the Hamiltonian of S under control (Fig. 1). 



Pin 



noiseless 
channel A (t/) 



Pout = A (c/) [pi„] 



FIG. 1: Without any noise, a unitary channel A^ can be implemented with a suitable controlled Hamiltonian H$ in a time 
interval t satisfying eq. (42). 



r 



In real life, the system S is coupled weakly with the 
environment E and causes the decoherence that cannot 
be corrected by controlling the Hamiltonian Hs of the 
system S, so that the noise is brought from the envi- 
ronment. Assume that the noise is controlled by one 
unknown positive parameter v. The estimation of the 
noise parameter v often becomes critical in development 
of quantum devices such as quantum computers. 



J 



The total Hamiltonian reads 




Ht, 



Hs + H S e + He 



(43) 



where He is the Hamiltonian of E and Hse is the in- 
teraction Hamiltonian between S and E. Because of the 
noise, the actual output state p' out deviates from the in- 
tended output state p ou t (Fig. 2). 



Pout = Tr B [e- UHt " t (p ln ® pe) e ltH ^] 



Pout = A„ \flin] 



A (U) 



P'out = r„ [A^[ Pin ]] 



FIG. 2: A controlled unitary process, such as quantum computing, usually suffers from noise (the first line). The noisy process 
is described as a TPCP map, a channel A v , parametrized by one noise parameter v (the second line). In quantum theory, the 
disturbed process is equivalently described by a sequence of two channels. The first channel is the originally intended unitary 
channel A^K The second channel r„ describes the genuine noise effect. We call T„ the noise channel (the third line). 
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By using H tot , the output state p' out is determined in 
principle by 



Tr B [e~ itH ^ ( Pin ®p E )e itH <°*], 



(44) 



where Tr^ is the partial trace over E and pe is the initial 
state of E. Theoretically it is preferable that we deter- 
mine the value of the noise parameter v via Eq. (44); 
however, the explicit calculation of Eq. (44) is too com- 
plicated to perform in many cases. Hence, adopting a 
reasonable theoretical model of the noise effect, the ac- 
tual value of its noise parameter of the model should be 
experimentally estimated. 

Without assuming any detailed knowledge about He 
and Hse, it is natural to represent the noisy process by 
a TP CP map A„ such that 



Pout — ^v[pin]i 



(45) 



where the relation A = A^) holds as the noiseless case. 
In quantum theory, the channel A„ can be equivalently 



J 



described by a sequence of two channels (the third line of 
Fig. 2). The first one is the target unitary channel 
and the second represents the genuine noise part. This 
means that the general noisy process is equivalent to the 
noiseless unitary process followed by an instantaneous 
noise process. The second channel is called the noise 
channel T„ and defined by 



Y v [p] :=k v [tfpU]=k v [{kW)- 1 [p]}. 



(46) 



Using the definition and the ideal output state p ou t , it is 
possible to write the actual output state p' out such that 



v v [u Pm u^ = r u [ Po 



(47) 



When the noise vanishes, the channel reduces to the iden- 
tity channel: 



ids. 



(48) 



IV 



Pv out — IV [pin] 



(A (C7) ) _1 [p 



■in J — Pin 



P ' v ,out=k v [(A^)- 1 ^]] 



FIG. 3: In order to experimentally generate the output state p„ l0 ut of the noise channel r e for an arbitrary input state pi n , we 
take p' in = (A^') -1 ^^], which is independent of v, as the input state for the actual channel k v . 



r 



It is stressed that despite that the noise channel Y v 
is conceptual constituent, it can be simulated in a real 
experiment by use of the actual channel A„ (Fig. 3). In 
fact, the output state of the channel Y v defined by 



Pu.out — r^[p^ n ] 

is exactly reproduced by 

p v ,out = K[{k^)- 1 [ Pin \l 



(49) 



(50) 



for an arbitrary input state pi n . Therefore, by adopting 
a known state p' in — (A^) -1 ]/)^], which is indepen- 
dent of v, as the input state of the actual channel A„, 
we experimentally obtain the output state p v ^ out of the 
noise channel Y v . This aspect sounds very significant. 
Actually, we can replace, not only theoretically but also 
experimentally, the estimation problem for a given real 
channel A„ into the equivalent estimation problem for 
the noise channel Y v . Hence, we later concentrate on es- 



timation of the noise parameters for Y v which satisfies 
relation (48). 

Next let us define mathematically the low-noise chan- 
nel r e . This is a kind of the noise channel and its noise 
parameter v takes small positive values, which is denoted 
by e. We call e the low-noise parameter. Physically, r e 
is expected to have an analytic e dependence near e = 0. 
A rigorous mathematical formulation of this requirement 
is given as follows. 

Since the low-noise channel r e is a TPCP map, it has 
a Kraus representations determined by a family of Kraus 
operators. We shall define low-noise channels in terms of 
their Kraus operators. A family of TPCP maps r e with 
one parameter e > is called a low-noise channel with 
low-noise parameter e if each r £ has a Kraus representa- 
tion 

Y e [p] = Y J B a (e)pBl(e) + e £ C a (e)p c£ (e) (51) 
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with two classes of Kraus operators {B a (e)} and 
{y/eCa(e)} satisfying the following conditions: 

(i) B a (e) is analytic at e = 0, so that we have the power 
series expansion 



(52) 



71=1 



in a neighborhood of e = 0, where n a and are con- 
stant coefficients and operators, respectively, indepen- 
dent of e. The noise channel condition in Eq. (48) re- 
quires 



£Kl 2 = i. 



(53) 



(ii) C a {e) is analytic at e = 0, so that we have the 
power series expansion 



C a (e) = M a + J2Mi n h n , 



(54) 



n=l 



(n) 

in a neighborhood of e = 0, where M a and M a are 
constant operators independent of e. 

Needless to say, the Kraus operators satisfies the trace- 
preserving condition 

Is = Y J Bl{e)B a {e)+eY J Cl{e)C a {e) 1 (55) 



where I5 is the identity operator. By definition, the 
relation 



lim r e 



ids 



(56) 



is automatically satisfied. 

It should be emphasized that our definition of the low- 
noise channel is general from the physical point of view. 
Except that T e satisfies Eq. (56) and has analytic depen- 
dence of e near the origin, the channel T e can be said to 
be a general quantum operation acting on the input state. 
Therefore, the low-noise channel should be always found 
in the weak-interaction limit of H$e for rather general 
physical processes. 

A useful comment is given here. Expanding Eq. (55) in 
terms of e generates a lot of recursion relations between 
n a , and M a n \ The higher components of the oper- 
ators and the coefficients are determined recursively and 
systematically by solving the equations using their lower 
components. The first-order relation in the e expansion 
of Eq. (55) is given by 



]>> a A^)t + <ivW) 



(57) 



One of our fundamental interests is to ask a question: 
which input state for the low-noise channel does maxi- 
mize the Fisher information of its output state pP. By 
virtue of the theorem reviewed in Section 2, the opti- 
mal input state is a pure state. Denote the input state 



by \4>){4>\. Then, from Eq. (52) and Eq. (54), p e can be 
expanded as 

Pt :=TM{<i>\\ = \m\-m+0{e 2 ). (58) 

Here p\ is given by 

a 

-Y,M a \<t>){mt (59) 

a 

For this output state p t , we perturbatively solve the equa- 
tion, 



d ePe = ^(L e p e + p e L e ), 



(60) 



in order to get the SLD operator L t . It is possible 
to check that the following solution actually satisfies 
Eq. (60) by substitution. 



- Pi + O(e). 



(61) 



By substituting Eq. (61) into the definition of the Fisher 
information, we get the value of the information such 
that 



Js[pe] = Tr[p £ L2] 



1<0M0>+O(e°). (62) 



By using Eq. (57) and Eq. (59), the Fisher information 
is evaluated in the leading order of e as 

Js be] = - e J2 {(<l>\M a M a \cb) ~ MM a m 2 ] + 0(e°). 

a. 

(63) 

From Eq. (7) the optimal output-measurement observ- 
able A opt for any input state is given by A opt p e = 
(Js [p e ] 1 L t + e)p e . The optimal input state \<j> pt) can 
be determined by maximizing Js[pe] with respect to the 
state \4>). 

Let us next discuss the low-noise channel in the ancilla- 
extended system S+A. Its extended channel is now given 
by r e <g)zcU. The input pure state |^") can be decomposed 
into 



(64) 



71=1 



where {|n}} is an arbitrary orthonormal basis of S and 
|A„)'s are normalized pure states of A, which arc not 
necessarily orthogonal to each other. The constants C n 
should satisfy the normalization condition: 



1. 



The SLD for the extended state is given by 
L e = 1(1 -|*)<*|) + 0(e°). 



(65) 



(66) 
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The Fisher information Js+A[p~e] for the output state 
p t = T e (g) id J 4[|W)( 1 S r |] is also evaluated in a similar man- 
ner. We have 



Js+AlPe] = - e Yl [Tr[pMtM Q ] - |Tr [pM a ]f 



0(e°), 
(67) 



where p is a state of S defined by 

p = Tr A [|*><*|] = ]T C* n {A n \A n )C n \n){n\. (68) 

nn 

The optimal output-measurement observable A opt for 
any input state |*) is given by A opt p t = {J s +A[p f \" 1 L e + 
el)/5 e . The optimal input state \^ pt) for the extended 
system is determined by maximizing Js+a[p<l\ with re- 
spect to p = Tr j4 [|*)(*|]. 

If the dimension of A is not less than that of S, we are 
able to make |A n )'s orthogonal to each other: 

{A n \A n ) = 5 nn . (69) 

Then the state p is reduced into a form such that 

P = J2\°n\ 2 \n)(n\. (70) 

n 

Note that the orthonormal basis {|n)} of £ and the 
coefficients C„ can be arbitrarily chosen except that 
^|C„| 2 = 1. Hence, p in Eq. (70) is able to describe 
any possible state of S. Therefore, the dimension of the 
ancilla Hilbcrt space suffices to be at most the same as 
the system Hilbert space. 

By combining both results of J$ and Js+a, we have 
the ancilla-assisted enhancement factor r] such that 



max 



E Q Tr[p 5 MtM Q ] -\T r [p s M a ] 



max 



£ Q (<j>s\MtM a \<j> s ) - \(<j>s\M a \<j> s ) 



I0s> 

(71) 

Here max[ ] ps means the maximum value over all possible 
states of S and max[ ]u s \ the maximum value over all 
possible pure states of S. 

Because the set of pure states of S is a subset of the 
set of states of S, the following inequality trivially holds: 



T] > 1. 



(72) 



A. Isotropic Depolarizing Channels 

An isotropic depolarizing channel is given by 

TM = U--Ap + -^Y. a -P a - ( 73 ) 

^ ' a=l 

This is a well known example induced by quantum noise. 
The parameter e(> 0) is just a probability that the qubit 
system becomes depolarized. The Kraus operators in 
Eq. (52) and Eq. (54) are given by 



1/2 

D ( ,(<) - [l-\e) Is. 



(74) 
(75) 



where a = 1,2,3. Hence the expansion coefficients in 
Eq. (52) and Eq. (54) are given by 



(76) 
(77) 



Kq 


= 1, 




= gls, 




(2n-3)!! (3 


1 


M a 


= 2^ a , 


MM 


= 0. 



(78) 

(79) 
(80) 



In this case, the Fisher informations have been already 
calculated [13]. For the isolated original system S, the 
information is independent of the input state and given 

by 



Js 



1 



e(2-e)' 



(81) 



For the extended channel r e ® id a, the optimal input 
state is the maximally entangled state and the informa- 
tion is given by 



Js+A 



e(4- 3e)' 
as long as the parameter e is small. 



(82) 



V. EXAMPLES OF LOW-NOISE CHANNELS 

Low-noise channels introduced in the previous section 
are found in a lot of applications. Checking that low- 
noise channels really appear in some physical phenom- 
ena may lead to a deeper understanding. Thus we give 
two critical examples in this section. The details of the 
channels we introduce below can be seen in Ref. [30]. 



B. Generalized Amplitude-Damping Channels 

A generalized amplitude-damping channel is given by 

2 2 

T[p] =^B a (6)pBt( e ) + e ^C Q (6) P Ct( e ), (83) 

a— 1 a— 1 
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where B v and C a are given by 
Bi(e) = \l 



1 



f)E 



1 
a/T - 



Ci(e) 
C 2 (e) 



-/3B 



1 + e-^ B 



1 



1 + e-P E 



VT~e 
1 



1 




-0E 



1 + e-f* E 





1 



(84) 

(85) 
(86) 
(87) 



The channel describes a relaxation process of the two- 
level system driven by a finitc-tcmpcraturc thermal bath. 
The temperature is (fcs/3) -1 where ks is the Boltzmann 
constant. Here the small noise parameter e is related with 
the survival rate s of the initial state under the relaxation 
such that e = 1 — s. The rate s is given by s = e~ 7 *, 
where t is time and 7 the relaxation rate constant. The 
corresponding coefficients in the e expansion are given by 



1 



Kl = 



K2 



1 + e-? E ' 



o-9E 



l + e-P E ' 



N (n) 



Ml = 



M, 



M {n) - 

m l,2 — 



1 1 (2n-3)!! 
1 + e~P E 2™d 

; e-P E (2n-3)!! 
1 + e-P E 2"n! 





1 

1 




1 



1 + e~P E 



-fiE 



1 + e~P E 



1 




1 



(88) 
(89) 
(90) 

(91) 
(92) 

(93) 
(94) 



The calculation of the Fisher information has been per- 
formed in Rcf. [16]. 

The two examples in this section will be discussed 
again in Section 6. 



VI. CHANNELS ON TWO-LEVEL SYSTEMS 

In this section we concentrate on a two-level system S2 
and an arbitrary ancilla system A. The dimension of A 
is not necessarily two, but assumed finite. Let us derive 
a universal bound on the ancilla-assisted enhancement 
factor 77 such that 

V<1- (95) 

The bound must hold for all low-noise channels of £2. 

As well known, any state p of the two-dimensional sys- 
tem S2 can be written by 



p=-l s + - x .a 



(96) 



where a is the Pauli matrix vector and the three- 
dimensional real parameter vector x takes values which 
satisfies 



< \xV < 1. 



For pure states, the vector is normal: 



|.x| 2 = 1. 



(97) 



(98) 



Similarly, the matrix M a in eqn(54) is uniquely ex- 
panded as 



M a = m a0 l s + ^2 m «°' J °- 



(99) 



Now let us define complex vectors j2 a (a — <~ 3) by using 
the coefficients m aa in Eq. (99) as 



A4 = (m aa ). 



(100) 



In the vector space, there exists a natural inner product 
defined by 



(101) 



A metric is also induced naturally from the inner product 
such that 



9ab ■= (Ma,M&) = 3fca> 



(102) 



where a, b = 1 <~ 3. For later convenience, define a real 
non-negative symmetric matrix H by 

H = [h ab ] = [Re g ab ] > 0, (103) 
and a real three-dimensional vector J by 

J = [Ja] = [Im ,g 23 , Im 331,1m g 12 }. (104) 

Here denote by hi, /12, /i3 the eigenvalues of H. Without 
loss of generality, we can assume that 



< hi < h 2 < h 3 . 



(105) 



Assume later that p, a (a = 1, 2, 3) are linearly indepen- 
dent. Even if it is not so, because of the continuity of 
77, we can take three linearly-independent vectors j2 a {t) 
parametrized by a real parameter t such that 



lim# (t) = p a . 



t-»o 



(106) 



In order to get 77, we first calculate the factor r){t) for 
{jl a (t)} and just take a limit as 



lim ri(t) = 77. 
t->o 



(107) 



Note that the linearly independence of {jla} also means 
H > 0. (108) 
This allows us to assume the existence of H -1 . 
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By a simple manipulation, we have 

TrH + JH~ l J- min 



(x + H-K^Hix + H- 1 !) 



\x\<l 



TrH + JH- 1 J - mm 



(x + H- 1 J)H(x + H- 1 J) 
I 



(109) 



l*l=i 



For the original system S, the optimal input state is Then we can prove that the function G satisfies 
given by 



1 1 _ 

-Is + -x opt ■ a 



(110) 



G(0) > G{J). 
To show this, we transform G(J) as 



where x opt is the vector which minimizes (x + 
H~ 1 J)H(x + H~ 1 J) among whole the unit vectors. The 
optimal input state \^>) for the extended system is also 
given as follows. Find a vector X which minimizes 
(x + H^ 1 J)H(x + H^ 1 J) among whole the vectors with 
|x| < 1. Then the optimal state |4 f ) is determined by 
solving the equation 

Tr A [\*)(n = \ls + lX-ff- (HI) 



G( J) = min 



XHX 



\X-H~ 1 J\ = l 



(116) 



By denoting K = H 1 J, the function G is given in the 
diagonal basis of H by 



G = min 



(117) 



J \X'-K'\ = 1 



Note that the relation \K'\ < 1 trivially holds. Also 
Let us consider the case where J = 0. The factor r\ is notice from definition (115) that if J = K' = 0, G takes 



given by 



f] = 



TrH 



\x\<l 



TrH - min 



Eo.fci x a x b h a b 
hi + h 2 + h 3 



1*1=1 



hi + h 2 + h 3 — min 



\x'\ = l 



Here we have made H diagonalized in the last equality. 
Consequently we obtain an expression of r\ such that 

= h 1 +h 2 + h 3 

hi + h 2 + hs - min [h\,h 2 ,hs] ' 

Taking account of hi < h 2 < hs, we can easily prove 
i] < 3/2 as follows. 

hi + h 2 + h 3 



V = 
< 
< 



h 2 + h 3 
2h 2 + h 3 
h 2 + h 3 
3h 3 _ 3 
2h 3 ~ 2' 



(113) 



Next let us discuss the case where J ^6. Suppose that 
(if" 1 J| < 1. Then we have 



(x + H- 1 J)H(x + H- 1 !) 



|x|<l 



0, (114) 



because we can always take a vector x such that x — 
-H- 1 .!. For later convenience, let us introduce a func- 
tion G(J) as 

G(J) := min \(x + H' 1 J)H{x + H~ x J)l . (115) 
L J|x|=l 



the minimum value of the eigenvalues of H, that is, hi: 
G(0) = hi. (118) 

To compare G(J) with this value hi, suppose a point X' 
on a trajectory defined by \X' — K'\ = 1 such that 

X' a = (k[ ± yfl-(Ktf-(Ktf, 0, o) . (119) 



Then we have 



(120) 



X' HX> = hi (k[ ± y/l (K' 2 Y (K<)^ 

Here we fix the double sign in the above equation so as 
to satisfy the relation: 

X' HX' = hi (\K[\ -y/l- {K' 2 Y (K^ 2 . (121) 

Since the relation \K'\ < 1 holds, it is guaranteed that 

1> (l^l-V 1 -^) 2 -^) 2 ) 2 - (122) 
Therefore the important inequality 
G(0) > G(J) 
really arises as follows. 
G(0) = hi 



> hi (\K[\ y/l [K' 2 f (Ktf) 

G(J). 



> min 



I'M' 



\X'-K'\ = 1 



(123) 

X'oHX'o 
(124) 
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Note that 



and 



JH~ 1 J> 



TrH - G(0) = h 2 + h 3 > 0. 



(125) 



(126) 



Keeping Eq. (123), Eq. (125) and Eq. (126) in mind, 
let us go back to the proof of rj(J) < r/(Q). By using 
Eq. (114), we have 



TnH + JH~ 1 J 
TrH + JH-^J-GiJ) 



(127) 



By replacing G(J) by G(0) in the above equality, from 
Eq. (123) we obtain 



n 



(J) < 



TrH + JH^J 



[TrH - G(0)] + JH^J 
By using an inequality such that 



b + e < b 
a + e ~ a 



for a < b and e > with e = JH 1 J, we have 

TiH 



TrH - G(0) 
Consequently, we have obtained the bound 
V(J) < ?7(0) < 3/2. 



(128) 



(129) 



(130) 



(131) 



For the remaining case where \H 1 J\ > 1, the problem 
becomes much trivial. This is because 



(x + H~ 1 J)H(x + H- 1 J) 



mm 



S\<1 

(x + H- 1 J)H(x + H- 1 J) 



l*l=i 



(132) 



holds in this case. Therefore the relation 77 = 1 is satisfied 
in Eq. (109). 

Therefore, for all the possible low-noise channels, the 
bound rj < 3/2 has been proven. The equality 77 = 3/2 
can be attained by the channels satisfying 



gab OC S ab 



(133) 



with the maximally- entangled input pure states of S + A. 

The optimal input state depends on the vector J of 
the channel. When J = 0, the optimal input state is the 
maximally entangled state. If \H~ 1 J\ > 1, a factorized 
input state takes the maximum and gives r] = 1. When 
1 > \H~ 1 J\ > 0, the optimal input state is neither the 
maximally entangled state nor the factorized state. From 
the argument below Eq. (114) the output state \ip)s+A 
satisfies 



Tr A [\ip) s+a(iP\ 



1 



1 



S+A\ 



JH^a. 



(134) 



The value of r\ given by Eq. (127) also changes continu- 
ously between 1 < r\ < 3/2 depending on |_ff _1 J|. 

The channel dependence of the optimal input state has 
been already noticed in a generalized amplitude-damping 
channel [16] by changing the temperature of the ther- 
mal bath. Because of the simplicity of the model, it is 
possible to estimate the unknown parameter even in a 
finite parameter region. On the other hand, in this pa- 
per, the parameter region of the low-noise channel is con- 
strained to a neighborhood of a fixed value (e = 0). How- 
ever, we would like to stress that our channel includes an 
enormous number of degrees of freedom corresponding 
to K a , N a n ^ and compared with the generalized 

amplitude-damping channel. 

Note that the isotropic depolarizing channel (Eq. (73) 
in Section 5) is one of the channels attaining the bound 
{rj = 3/2). The vectors j2 a are calculated as 



1 



Mi = 



H2 = 



M3 



The corresponding matrix g a b is just evaluated as 

1; 

9ab — -^°ab- 



(135) 
(136) 
(137) 

(138) 



Thus the channel can achieve r\ = 3/2. 

On the other hand, the generalized amplitude-damping 
channels Eq. (83) in Section 5) cannot achieve the bound. 
The vectors jx a are now described by 



Mi = o 



1 



l+e- 



= 



_e Z £E 

l+e-C* E 



1 

l+e-f^ 





(139) 



(140) 



(141) 



The corresponding g a b is now given by 

„-l-e-' 3E 



[9ab 



1 









1 




(142) 



Because g a b S ao docs hold, the channel cannot satisfy 
r/ = 3/2 for any parameter value. In spite of the ancilla 
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extension, the ancilla-assisted enhancement does not ap- 
pear at all (rj = 1), as long as the low-noise parameter 
e is small enough. This is because the value of \H~ 1 J\ 
diverges and the relation \H~ 1 J\ > 1 always holds. 
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